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, We study a nonzero-sum game of two players which is a gener- 

ahzation of the antagonistic noisy duel of discrete type. The game 
is considered from the point of view of various criterions of optimal- 
■ ity. We prove existence of e-equilibrium situations and show that the 

I e-equilibrium strategies that we have found are e-maxmin. Conditions 

' under which the equilibrium plays are Pareto-optimal are given. 

^ ! Keywords: noisy duel, payoff function, strategy, equilibrium situ- 

^ [ ation, Pareto optimality, the value of a game. 

o 

O 1 Introduction 

_> : 

^ i The classical duel is a zero-sum game of two players of the following type. The 

^ I players have certain resources and use them during a given time interval with 

the goal of achieving success. Use of the resource 7 at the moment t leads to 
success with the probability depending on the amount of resource 7 and the 
time t only (it is usually assumed that the probability of success increases 
with time). As soon as one player achieves the goal he receives his profit, 
which is equal to his opponent's loss, and the game ends. Various assump- 
tions about the ways players use their resources and about players receiving 
information about the opponent's behavior during the game define various 
kinds of duels [H |2]. Models were considered where players' resources were 
discrete {discrete firing duels), infinitely divisible {continuous firing duels). 



1 



continuous for one of the players and discrete for the other one {mixed du- 
els, or fighter-bomber duels) [51 E]. Researchers studied noisy duels [SIEIE], 
where every player at a given moment of time had complete information 
about his opponent's behavior up to that moment, and silent duels, where 
no such information was available. At present time duels are considered as 
classical models of competition [H [S] . However, their application as such is 
somewhat limited by the assumption that the players' interests are strictly 
opposite to each other. Games of the duel type with nonzero sum belong to 
an unexplored class of infinite games with nonopposite interests. 

A nonzero-sum game which is a generalization of the classical antagonistic 
duel was first considered in [S]; it was a nonantagonistic noisy fighter-bomber 
duel. Then nonzero-sum duels were studed in [TU], [TT], [T^ . 

In this paper we study a nonzero-sum game of two players which is a 
generalization of the antagonistic noisy duel with discrete firing [3l H] . This 
paper is an extended verion of [13] with complete proofs. It is also a sequel 
to III]. 

The author is grateful to Leonid Positselski for his help in translating this 
paper into English and editing it. 

2 Preliminaries from Game Theory 

A game of two players is a quadruple 

T = {X,Y,K,{x,y),K2{x,y)}, 

where X and Y are sets of the players' strategies and Kj{x, y), j = 1,2 are the 
players' payoff functions, which are defined on the Carthesian product X xY 
and determine the j-th player's payoff when Player I uses a strategy x G X 
and Player II uses a strategy y &Y. A game F is called a zero-sum game of 
two players, or an antagonistic game if Ki{x, y) + K2{x, y) = for all x & X, 
y & Y and a nonzero-sum game of two players, or a nonantagonistic game 
otherwise. The mixed extension of a game F is the game F = ($, \E', K{lp, -ip)) , 
where $ and \E' are the sets of distributions on X and Y and Kj[ip,%lj) 
(j = 1,2) are the mean values of the payoff functions Kj{x,y) over the 
distributions ip E ^, ip E . Pure strategies of the game F are distributions 
concentrated in one point x E X or y E Y, respectively. All the other 
distributions ip E ^, ip E are called mixed strategies. 
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A situation {x,y) is a pair of the player's strategies. A situation {xe,ye) 
is called an equilibrium situation if for all x & X, y & Y the following 
inequalities hold: 

Ki{x,ye) ^ Ki{xe,ye); K2{xe,y) ^ K2{xe,ye)- (1) 

The vector {v^,v^), where = Ki{xe,ye) and v"^ = K2{xe,ye), is called 
the equilibrium value corresponding to the equilibrium situation {xg, ye), and 
a strategy included in some equilibrium situation is called an equilibrium 
strategy. 

An infinite game with nonantagonistic interests may not admit an equi- 
librium situation. A situation {x'^^y'^) is called an e-equilibrium situation if 
for any x E X , y E Y the following inequalities hold: 

K^{x,y')-e^ K^{x\f)- K^ix^y) - e ^ Mx^y'). (2) 

A player's strategy included in some £-equilibrium situation will be called an 
e-equilibrium strategy. If the limits 

= limK^{x',f) (j = l,2), 

£-+0 

exist, the vector {y^, v"^) will be called the equilibrium value corresponding to 
a set of £-equilibrium situations 

A strategy x,n G X is called a maxmin strategy of Player I if the function 
mi{x) = inf Ki{x, y) attains its maximal value in it. Analogously, a strategy 

yra E Y is called a maxmin strategy of Player II if the function m2{y) — 

inf K2(x,y) attains its maximal value in it. The vector 
xex ^ ^ 

w = {w^jw"^), where = max inf Ki(x,y), w'^ = max inf K2{x,y), 

xex yeY yeY xex 

is called the maxmin value of a nonzero-sum game. The value is the best 
guaranteed payoff of the j-th player. 

In an infinite game maxmin strategies may not exist. In this case e- 
maxmin strategies are considered. A strategy x'^ e X is called an s-maxmin 
strategy of Player I if 

mi(x^) > — e, where = sup inf Ki{x,y), 
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and is called the maxmin value of the game of Player I. Analogously one 
defines an e-maxmin strategy of Player II: a strategy & X is called an 
s -maxmin strategy of Player II if 

m2(y^) > w'^ — e, where w'^ = sup inf K2{x,y), 

y(zY xex 

and w"^ is called the maxmin value of the game of Player II. 

If an antagonistic game admits equilibrium situations, then the equilib- 
rium values corresponding to them coincide and are equal to the maxmin 
value. Equilibrium strategies of an antagonistic game are maxmin and they 
are called optimal, while £-equilibrium values are e-maxmin and they are 
called e-optimal. A nonzero-sum game may admit equilibrium situations 
with unequal equilibrium values. 

Let us introduce a partial order relation >- in R^: 

ayb, if Qj ^ bj {j = 1,2), 

where at least one of the inequahties is strict. Denote by S the set of all 

situations s = {x,y). The relation of Pareto preference on the set S is 
defined as follows. Let = {x\y^) {i = 1,2). Then 

>~ s^ if >~ K^, where K' = (Ki(x\ y'), K2(x\ y')), i = 1, 2. (3) 

A situation = {xp, yp) is called Pareto- optimal if there are no situations 
s — (x, y) such that s >- s^. 

In an antagonistic game all the situations are Pareto- optimal. A game 
is called quasi- antagonistic, or a game with opposite interests, if all the 
situations of the game are Pareto-optimal, i. e., if for any two situations 
{x^,y^), {x^,y'^) G S the following conditions hold: 

K^{x\y') < Ki{x',y^) <^ K2{x\y') > Kr{x\y'')); 
K,ix\y') = K,{x\y') ^ K2{x\y') = K,{x\y')). (4) 

3 Posing the problem 

Consider a nonzero-sum game of two players which is a generalization of the 
antagonistic noisy duel with discrete firing. 

The players compete in the conditions of complete information. They 
have discrete resources m, n e N which they use during the time interval 
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[0,1]. The effectiveness of the j-th player using his resource is described 
by the function Pj{t) (j = 1,2), which defines the probabihty of achieving 
success when using the unit of resource at the moment t. The functions 
Pj(t) are continuous and increasing, Pj{0) = 0, -Pj(l) = 1, < Pj{t) < 1 for 
t e (0, 1). If one of the players achieves success, the game stops. If a player 
has used all of his resource and hasn't achieved success, the other player 
postpones his action until the moment t = 1, when his probability of success 
is equal to one. The profit of the j-th player in the case of his success is 
equal to Aj, and his loss in the case of his opponent's success is equal to Bj, 
where 

Aj ^ 0, Bj ^ 0, Aj + Bj > 0, j = 1, 2. (5) 

The players' profits are equal to if no one of them achieved success or if the 
success was achieved by both of them simultaneously. A player's strategy is a 
function assigning the moment of next action to a pair of amounts of players' 
current resources. Let us call the described game a noisy nonzero- sum duel 
with discrete firing. 

Denote by (0 ^ Tj ^ Ti_i ^ 1, i — 1,2, ... ,m) the moments of time 
when Player 1 uses his resource. Analogously, denote by rji {0 ^ rji ^ rji^i ^ 
1, i = 1,2, ... ,n) the moments of time when Player 11 uses his resource. 
The vectors r and ri will be called the vectors of action moments of the 
players. The payoff function Kj{T,rj) is the mathematical expectation of 
profit received by the j'-th player in the case when the players use their 
resources at the moments of time Xj, r]j {i = 1, . . . ,m; j = 1, . . . ,n). It is 
computed in the following way. If m = 0, n = 0, then Ki = K2 = 0. If 
m ^ 1, n = 0, then Ki = Ax, = — -82- If m = 0, n ^ 1, then Ki = —Bi, 
K2 = A2. Assume that m ^ 1, n ^ 1. Set 



Then 



Ki{T,r]) = < 



t' = (t^_i,...,ti); 77' = (77„_i,...,77i). 



(A,P,{rJ + (1 - P,{rJ)Kxir',r]), if < rj^, 

APi(rJ(l - P2(rJ) - - Pi(rJ)P2(rJ+ 

+ (1 - Pi(T„,))(l-P2iTm))K,{T',7]'), if = TJ^, 

-BiP2{Vn) + (1 - P2{Vn))Ki{T, i), if > 77„. 

(6) 
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'A2P2{Vn) + (1 - P2{r]n))K2{T,r]'), if 7]^ < T„ 

A2P2irm)il - P2(r™)) - 52(1 - Piirm))P2irm) + 
+ (1 - Pi(r„))(l - P2iT^))K2iT',r]'), if r„ = 

^-B2Pl{Tm) + (1 - P2{Tm))K2{T',r]), H r]n > Tm- 

(7) 

Let us denote A = [Ai, A2), B = [Bi, B2) and call A the profit vector and 
B the loss vector of the players. Introduce the vector-function of effectiveness 
P{t) = {Pi{t) , P2{t)) . Let us denote the described game by Tmn{P, A, B) or, 
for brevity, by F^n and its mixed extension by Tmn{P, A, B) (or r^n)- 

Suppose that in the duel Vmn{,Pi A, B) the profit and loss vectors are 
related by the equation Ai = B2, A2 = Bi, i. e., the profit of each player is 
equal to his opponents' loss. Then it follows from the relations (jH]), dZj) that 
Kiij^Tj) = —K2{T,ri), i. e., under these conditions the game is antagonistic. 

4 Situations of e-equilibrium 

Lemma 1 (Fox, Kimeldorf [3]). There exists a set {tij i,j G N} such that 

m n 

J] (1 - Pi(t,„)) + n (1 - P2{tm,)) = 1, (8) 
i=l j=l 

and for all m,n the following inequalities hold: 

<tmn < mm{tm-l,n,tm,n-l), where ton = tmO = 1- 

Set 

A = min {l/(Ai + B^), 1/(^2 + B2)} /2. (9) 
Choose e > and find 6j (j = 1, 2) such that 

trn,n—l 

) and P,{6j) < Pjitmn) + Xe. (10) 

Take 6 = min{5i, 62}. Let z/ be the current values of the players' resources 
and yj^j, be the uniform distribution concentrated in the interval [t^^, t^j/ + 5]. 
Define the players' mixed strategies x'^, y'^ in the following way. The strategy 
(y^) prescribes to choose the next moment of action (rj,^) in the random 
way according to the distribution function 
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Theorem 1. The situations (x^, y^) in the game TmniP, A, B) are e- equilibrium 
situations. The vector Vmn = {f^mny "^mn) defined by the formulas 

m n 

vl, = Ai- {Ai + Bi) J] (1 - Piitin)) = {Ai + 5i) J] (1 - P2(tmi)) - B,- 

i=l j=l 



mn 



(^2 + B^) n (1 - Pl{Un) -B2=A2- (^2 + B2) n (1 - P2itmj)) 
1=1 j=l 

(11) 

is the corresponding equilibrium value. 

Proof. The second equations in both lines of (11 II) hold by Lemma 1. Let us 
obtain recurrence relations for Vmn- By flTT]) we have: 

m— 1 



i=l 

m— 1 



= ^1 - (^1 + 5i) (1 - Pi(t™n)) n (1 - = 

i=l 

= Ai-(l-Pi(t™„))(t;;^_i_„-Ai). 
Finally we have an expression for t>^„ in terms of „: 
fQ„ = — i?i for n > 0; 

Vl^^ = AiPi{tmn) + (1 - Pl{tmn)) vl,^l,n ioT m > 0, U > 0. (12) 

Now obtain an expression for in terms of f^„_i. By ffTTl) we have: 

n-l 

n-l 

= [A, + B,) (1 - P2(tmn)) n (1 - - Pi = 

= (1-P2(t.„n))«„-1 + Pl)-Pl. 

Hence we get: 

w^o = ^1 m > 0; 

= -BiP2{tmn + (1 - P2(imn)) n-l for m > 0, 72 > 0. (13) 
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Analogously we deduce recurrence relations for in terms of f 4-i n 

m,7i— 1 ■ 

■^mo = "-^2 for m > 0; 

= AsPsl^mn) + (1 - P2(tmn)) <„_i for m > 0, n > 0. (14) 
= A2 for n > 0; 

= -B2Pi{tmn + (1 - i'lltmn)) for m > 0, n > 0. (15) 

To prove that (x^,?/^) are e-equilibrium situations it is necessary and 
sufficient to verify the following relations: 

K^{t, /) ^ :^l(x^ /) + e for any r; (16) 
K2{x'^, rj) ^ ^2(3;^, y^) + £^ for any 77; (17) 
limFl(x^y^)=^;L; (18) 

n— >oo 

lim¥2(x^y^)=^;L■ (19) 

n— »oo 

Lemma 2. For any e > i/iere exzsi such strategies , that for any pure 
strategies r and rj of Players I and II the following inequalities hold: 

Ki{T,f)<vl^ + e; (20) 
Ki{x',r])>vl^-e. (21) 

Proof. For n = and arbitrary m > by the definition of payoff function 
we have: 

K, = A,; K2 = -B2. 
Analogously for m = and arbitrary n > 

Ki = -Bi; K2 = A2. 

By ( 1111) we have: 

^mO = ^mO = -^2; V]^ = -Br, fo„ = ^2- 

In both cases the inequalities fl20|) . fl2Tl) hold. 

For arbitrary m > 0, n > we proceed by induction on the number of 
action moments of the players. Assume that Lemma is true for all pairs (/i, u) 
for which fi ^ m, u ^ n, (/i, u) 7^ (m, n) and prove it for (yU, u) = (m, n). 
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Every pure strategy of Player I has the following structure. Let t G [0, 1] 
denote the planned moment of his first action. If Player II acts and misses at 
a time rjn < t, then Player I follows a pure strategy in Pm,™-!- If Player II 
does not act before the time t then Player I acts at the time t, and unless 
Player II also acts at the time t, Player I after that adopts a pure strategy 

in Tm-i,n- 

Every pure strategy of Player II has the similar structure. Let m G [0, 1] 
denote the planned moment of his first action. If Player I acts and misses at 
a time Tm < u, then Player II follows a pure strategy rj^ in Tm-i,n- If Player I 
does not act before the time u then Player II acts at the time u, and unless 
Player I also acts at the time u, Player II after that adopts a pure strategy 

1]"^ in Tm,n-1- 

The strategy of Player II is constructed as follows. Fix e > 0. Choose 
t randomly according to v^^n- If Player I does not act before the time t, then 
Player II acts at the time rjn = t and, unless Player I also acts at the time 
then adopts the strategy y^''^ in Pm.n-i- If Player I acts and misses at the time 
Tm < u, then Player II adopts the strategy y^'^ in Tm-i,n- Accoding to the 
inductive assumption, we choose y^'"^, y"^'^ such that for any pure strategies 

and of Player I the following inequalities hold: 

Zi(r\y^'^) e/2 (22) 

K,{T\y^^')<vl_,^^ + e/2 (23) 

The strategy of Player I is constructed similarly. Choose u randomly 
according to v^^n- If Player II does not act before the time m, then Player II 
acts at the time rin = u and, unless Player II also acts at the time u, adopts 
the strategy x^'*^ in Tm-i,n- If Player II acts and misses at the time < u, 
then Player I adopts the strategy x^'*^ in Tm,n-i- Accoding to the inductive 
assumption, we choose x^'*^, x^'^ such that for any pure strategies rj^ and 
of Player II the following inequalities hold: 

K^ix'^^,v')>vl_,^^-e/2 (24) 
K^{x''',v')>vl,„-l-e/2 (25) 

For all strategies described above we have ignored the response to simultane- 
ous actions of players since this event has probability when Player I adopts 

or Player II adopts y^. 

Let us prove the inequality (12(1 . 
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Note that by the definition of payofi^ function for any strategies x, y we 
have: 

- ^ K,{x,y) ^ A, {j = 1,2) =^ 
K,{x,y) + B,;,0; (26) 
A,-K,ix,y)^0. (27) 

Let r be an arbitrary pure strategy of Player I and t be the time of first 
action in r. There are three cases to be considered. 

1. Suppose t G [0,t.mn]- In this case Player I always acts first; so 

KiiT,y') = A,P,{t) + (1 - P,{t))K,{T\y^^'). (28) 

One can see from the inequality fl27l) that the right hand side of f l28p 
does not exceed 

{A,-K,{r\y'^^))P,{t^^,) + K,{T\y'^^) = 

= A^Pi{tmn) + (1 - Pi{trnn))Kl{r\y^''). 

According to (1231) . the latter expression is less than 

AlPlitmn) + (1 ~ Plitmn))(y'i^m-l,n 

+ 8/2). 

Using the recurrence relation (fT2|) we get: 

AlPlitmn) + (1 - Pl(tmn)){Vm-l,n + sf^) = Vmn + (1 " Pl{tmn))£- 

Therefore 

ii'l(r,/) < Vmn + £- 

2. Suppose t G {tmn,tmn + S) ■ In this case either player may act first; so 

t 

Ki{r,f) = J (-5iA(0 + (1 - P2{0Wi{T\y''^))dv^^^{0 + 



+ J {AiPi{o + {i-pmKi{T'yn)d^tnn{0- 



t 
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Estimate the first integrand using the relations fl2^ . and (fT^ : 

^ Ki{r\y^^') - P2(t„^n)(^l(r^y^'^) + Pi) = 

- BiP^itmn) + (1 - P2(tm„))^l(r\y^'^) ^ 

^ -PlP2(t„„) + (1 - P2(t™„))K,„-l + s/2) = 

= Vmn + (1 - P2(tmn))£/2 < + £■ 

Estimate the second integrand using the relations fl27|) . fl26|) . IQ, ffTOj) . 
and ([121): 

AlPl(o + (l-Pl(e)):^l(r^^/''^) = 

^ (Ai-Zi(r2,|/2,-))(Pi(t™„) + A£)+Zi(r2,|/2.-) ^ 

< AiPiitmn) + (1 - Pi(W)):^i(r2, 2/2,-) + (Ai + Pi)A£ ^ 

^ ^iPlitmn) + (1 ^ -Pl(imn))('i^m-l,n + e/2) + 6/2 = 
= Vmn + (1 - P2{tmn))£/2 + e/2 < Vmn + 

After integrating we get: 

mn 

+ e. 

3. Suppose t G [tmn + f^, !]• Ill this case Player II always acts first; so 

Ki{T,yn= I (-PiP2(0 + (l-i^2(0)^i(r\2/''^))rf^UO- 

Estimate the integrand using the relations ( l26l) . ( l22l) . ( fT3|) : 

- PiP2(0 + (1 - P2(0^i(r\ 2/^'^) = 
= Zi(r\2/i'^) - (Zi(r\yi'^) + Pi)P2(0 ^ 
^ Ki(r\y'^) - (Zi(r\y'^) +Pi)P2(t„„) = 
= -B.P^itmn) + (1 - P2(t„^„)):^i(r\yi'^) ^ 

^ -BiP2(tmn) + (1 - P2(tmn))(^^m,n-1 + e/2) < Vmn + £■ 
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After integrating we have: 

Ki{T,y'') < Vmn + e- 



Hence the inequahty fl20l) is proved. 

Now let us turn to the inequahty fl^T]) . Let rj be an arbitrary pure strategy 
of Player II and u = rin he the first action time in rj. 

There are three cases to be considered. 

1. Suppose u G [0,tmn]- In this case Player II acts first; so 



K,{x',r]) = -B,P2{u) + (1 - P2H)^l(x'•^r7'). 
By the relations ([26D, ([22]), and ([I3D, we have: 



= -B,P2it^n) + (1 - P2it^n))Kl{x^'',V^) > 
^ -BiP2{tmn) + (1 - P2{tmn)){v,n,n-l " e/2) = 
^mn (1 -Pi (^mn) )^/2 ^ Vmn ^• 



2. Suppose u e (tmnytmn + S) . In this case either player may act first; so 



+ / {-B,P2{0 + {l~P2{0)Ki{x'^^v'))dvU0- 



Estimate the first integrand using the relations ( 1271) . ( I25l) . and ( |T2l) : 



AiPi(0 + (i-Pi(0)^i(^'^^') = 

^ (Ai -Zl(a;2'^r72))Pl(^^„) + Ki(a;2'^ r^^) = 
= AiPi(t„„) + (1 - Pl(t„„))Zl(x=''^r72) ^ 
^ AiPi(t„„) + (1 - Pi(t,„„))(t;™_i,„ - e/2) = 




= V, 



(1 - P2{tmn))e > Vmn - £■ 
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Estimate the second integrand using the relations ([27]), (fTU]) . 
and ([T^: 

- i?iP2(o + (1 - = 

^ -B^P^itmn) + (1 - P2(t™n))^l(x^•^ r/^) - (Ai + Pi)A£ ^ 

^ -B.P^itrrJ + (1 - P2(tmn))(t^m,n-1 - £/2) - £/2) = 
= Vmn - (1 - P2(tmn))£:/2 - e/2 > Umn " 

After integrating we get: 

ii'l(r,/) > Vmn - £■ 

3. Suppose u G [tmn + f^, !]• In this case Player I acts first; so 
By the relations ([27D, ([22]), and ([HD, we have: 

¥l(x^r/)= y ((^-Zl(a;l'^r/l))Pl(0+:^l(x^'^V))^^^^„(0^ 
= A^P,{tmn) + (1 - Pl(^™„)):^l(xl•^r7l) ^ 

^ AiPi(t„„) + (1 - Pl(tm„))(fm-l,n - £^/2) = 
^mn (1 Pl(^mn)^/2 ^ "i^mn s/2,. 

Hence the inequality ( !2T|) is proved. □ 

Let us continue to prove the theorem. Choose e > 0. By Lemma [2], we 
can find a strategy y^, satisfying (|20i) for any pure strategy r of Player I. 
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From the inequality fl20l) it follows that 

Ki{x,y')<vl^ + e (29) 

for any mixed strategy x of Player I. In particular for x = we have: 

K,{x\ f) <vl^ + e ^ vl^ > KM'. /) - e. (30) 

From the inequality (I2T1) it follows that 

K,{x',y)>vl^-e (31) 

for any mixed strategy y of Player 11. In particular for y = y"^ we have: 

Zl(a;^ f) > vl^ -e ^ vl^< K,{x', y') + e. (32) 

Taking in account the relations ( |29l) . (|32l) we obtain 

Zi(x, /) <vl^ + e< Zl(x^ y^) + £ 

for any mixed strategy x of Player 1. Hence inequality (fT6ll is proved. The 
relation ([HD follows from (|30D and (132|). 

Due to the symmetry of the setting the relations (fTTIl . (fT9l) are proved by 
the same arguments that (fT6l) . ffTSj) . □ 

Theorem 2. T/ie maxmin value of the game Vmn{,P, A, B) coincides with 
the equilibrium value Vmn- The e-equilibrium strategies x'^ , y^ are e-maxmin 
strategies. 

Proof. Let us use the properties of the zero-sum duel Tmn(yP,C,C), where 
Ci = Ai, C2 = Bi, C = (C2, Ci). In an antagonistic game the equilibrium 
value coincides with the maxmin value and e-equilibrium strategies are e- 
maxmin strategies. As the equilibrium value of Tmn{P,C,C) is equal to 
■^mn SO the maxmiu value of this game is also equal to vl^^ and the 
e-equilibrium strategy x'^ of Player 1 is his e-maxmin strategy. Thus the 
maxmin value of the game Tmn{P, A, B) for Player 1 is equal to and x^ 
is his e-maxmin strategy. 

The statement of the theorem for Player 11 follows from the symmetry of 
the setting. □ 
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5 Pareto-optimal plays 



The pair p = (r, r]) of vectors of action moments realized during the game is 
called a play. Let us denote the set of all plays of a noisy duel Tmn{P, A, B) 
by V. Note that P is a subset of the set of all situations of the corresponding 
silent duel (with the same effectivess functions, resources, and vectors of 
profit and loss). Namely, V includes exactly those situations of the silent 
duel in which after one of the players has used all of his resource, the other 
one postpones his action until the moment t = 1. A play G P is called 
Pareto-optimal if there exist no p^ & V such that p"^ >~ p^. Plays p^ and 
p"^ are called incomparable if p^ )f p^ and p^ )/- p^ . Plays p^ = (r^,?7^) and 
p'^ = (r^,?7^) are called equivalent if Kjij^^ri^) = Kj{T'^,rf) (j = 1,2). 

A play p = {T,r]) E V is called a T-play if for any k, I {1 ^ k ^ m; 
1 ^ / ^ n) at least one of the following two equations holds 

Tfe = tki] rji = tku 
where k, I are the current resources of the players. 

Lemma 3. Let p^ = {t^ ,t{^) he an arbitrary T-play with noncoinciding 
action moments of the players, i. e., ^ rji for all k {1 ^ k ^ m) , I {1 ^ 
I ^ n). Then 

K^iT^,V^) = vL; K,iT^,V^) = vl^. (33) 

Proof. Let us prove Lemma by induction on the number of action moments 
of the players. For n = 0,m>Oorm = 0,n>0 the assertion is true as 
= Ai] t>^Q = — -B2, = ~Bi; = ^2- Assume that the statement is 
true for all pairs (A;, /) such that k ^ m; I ^ n; k + I < m + n and prove it 
for {k, I) = (m, n). Suppose that Player I acts at the moment t = tmn- Then 
using the recursive formula ([6]) for < ?7„ and the inductive assumptions 
we obtain 



(m—l 
Ai-{Ai + Bi) II (1-Pi(ti„)) 
i=i 




= Ai- {A, + 5i) J] (1 - Pi(t,„)) = u 

4 = 1 
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Suppose that Player II acts at the moment of time tmn- Then using the 
formula (jSD for > rjn and the inductive assumptions we obtain 



n-l 



Ki{T,r]) = -BiP^itmn) - (1 - P2{tmn)) (^1 + ^i) " P^itmj)) ' Bi 

\ i=i 

n 

= -B, + {A, + B,) n (1 - P2{tra,)) = V\n- 

The second player's payoff fuction is considered in the analogous way. □ 

Lemma 4. Let the plays = {t^,7]^), p^ = {j'^^rf) G V satisfy the condi- 
tions 

vl = nf,rl = T^ fori ^2; (34) 

rl = tu; T]l<tii; t]1 = 1; r"^ = tn; rjl = tn. (35) 

Then we have (1) if A>~ B, then p^ >- p^ ; (2) if B y A, then p^ y p^. 

Proof. Express the payoff function of the plays p^ and in the following 

way: 



i^i(r\ V) = Ki{Tm, ...T2;vn,... V2) +11(1- Pi(r,)) X 

1=2 

n 

X n (1 - P2{Vj)) {AiPiitn) - i?i (1 - Pi (til))) ; 

m 

Ki{t\ t]^) = Ki{Tm, ...T2;vn,... V2) +11(1- Pi(r,)) X 

i=2 

n 

X J] (1 - P2{r]j)) (AiPi(tn) (1 - P2{tu)) - fii (1 - Pi(tii)) P2(tii)) 

Consider the difference Ki{T^,ri^) — Ki{t'^ ^rf ). Taking in account that by 
Lemma [T] 

Pl(tll)+P2(tll) = 1, 
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we have: 

m n 

= {A, - B,)P,{tn)P2{tii) n (1 - n (1 - . (36) 

i=2 j=2 

Analogously, 

K,{T\r]')-K2{T\r]') = 

rn n 

= {A, - B,)P,itn)P2{tn) J] " ^i(^O) H " ^^l^.)) • (37) 

i=2 j=2 

The statement of Lemma follows from fl5B|) . (j57j) . □ 

A duel may have one of four alternative results: Hq — no one of players 
achieves success; Hi — Player I achieves success; H2 — Player II achieves 
success; H3 — both players achieve success simultaneously. We denote the 
probability of the result Hi in play (r, rj) by Qj(r, rj) {i = 0, 1, 2, 3). Then: 

3 

J2Q^{r,v) = l■ (38) 

i=0 

Success of both players is possible only if they act simultaneously, because if 
one of the players achieves success, the game stops. As the payoff function 
Kj{T\ri) is the mathematical expectation of profit, so 

Ki{T-r^) = Aigi(r,r/) - BiQ^ir^r^)- (39) 

K2{T-r^) = A2Q2{r,r^)-B2Qi{T,r^). (40) 

By (l38l) . we have: 

Ki{a,(3) = Ai- (Ai + 5i)Q2(a,/5) - Ai(Qo(a,/3) + Q3(a,/3)); (41) 

K2{a- /?) = -B2 + {A2 + 52)Q2(«, 13) + 52(Qo(«, /?) + QM, /?)). (42) 

Denote hj V G V the set of plays with noncoinsiding action times in 
which Ti = 1 or ?7i = 1 . The complement V \ V' includes plays in which the 
players use their last units of resource simultaneously. 

Lemma 5. Let p' = (r', t]') G V and p = {T,ri) G V \ V' . In this case 
(1) ifp' y p, then A1A2 > B1B2; (2) if p y p' , then A1A2 < B1B2. 
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Proof. Let (t', r^') G so the relations (1411) . fl42p reduce to the form: 

K^{T',r]') = - (Ai + 5i)Q2(r',V); (43) 
i^2(r',r/') = -B2 + (A2 + 52)g2(r , V). (44) 

Consider the differences = Kj{T,ri) — Kj{T',r]'). Applying the formu- 
las dUD, dig), (113]), (HH) we get: 

Ai = {A, + BMr\ rl- r, r]) - AiQ(r, r/); (45) 
A2 = -(^2 + 52)/(r', r/'; r, r^) + S2g(r, r/). (46) 

where 

Q{r',v') = QoiT',r^') + QsiT',r^'). 

Let us prove the first assertion of Lemma. If p' >- p, then Ai ^ 0, A2 ^ 
and at least one of this inequalities is strict. Since Aj + Bj > {j = 1, 2), it 
follows from (05]), that 

Hr',v';r,v)^^^Qir,vy, (47) 
nr',v';r,v)>^^^Qir,v). (48) 

A2 + -D2 

Since Q{T,ri) 7^ for p = (r, r^) G "P \ P', combining the inequalities ( 1471) . 
IHl) and taking in account that at least one of them is strict, we get: 

< -r^^ =^ A,A2 > B,B2. 



A2 + B2 Ai + Bi 
The second statement of the lemma is proved in the analogous way. □ 

Lemma 6. Letpi,p2 G V . Then pi and p2 are Pareto-incomparable. 

Proof. If p = (t,!]) G V, then from the relations (H3|) . (jH]) it follows that: 

>■ 0\ A1A2 — B1B2 A2 + B2J., ,._s 

The statement of Lemma immediately follows from fH9l) . □ 
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Theorem 3. If in the game TmniP, A, B) the coefficients of profit and loss 
of the players satisfy the condition 

A1A2 = B1B2, (50) 

then the game is quasi-antagonistic. 

In the paper [12] a similar theorem about a sufficient condition of quasi- 
antagonisticity was also proven for continuous and discrete duels. 

Proof. Note that if the relation (l50!l holds, then there exists a number A > 
such that 

K^{T,r]) = -XK2{T,r]). (51) 

Indeed, suppose A1A2 > 0. Then accoding to (|5U]) B1B2 > and A1/B2 = 
B1/A2. In this case from the formulas fl39l) . PUI) one can see that fl5T]) holds 
with A = A1/B2. Suppose that one of the numbers Ai or A2 is equal to 
zero. If Ai = 0, then by ([5]) we have Bi 7^ 0, and from fl50|l it follows 
B2 = 0, A2 0. Then in view of the relations fl5^ . fllU]) we conclude that 
the equation fl^ holds with A = B1/A2. The case ^2 = is considered 
in the similar way. So it is proved that there exists A > for which the 
equation (15T|) holds. It follows from (!5T]) that the game is quasi-antagonistic. 

□ 

Theorem 4. // the coefficients of profit and loss of the players are related 
by the inequality A1A2 ^ B1B2, then T-plays with noncoinciding action mo- 
ments of the players are Pareto- optimal. 

Proof. If A1A2 = B1B2, then by Theorem [3] all plays are Pareto-optimal. 
In any T-play with noncoinciding action moments the last action of a player 
happens at t = 1, i. e., max{ri,?7i} = 1. Suppose A1A2 > B1B2. We will 
show that p p' for any plays p & V, p' & V . There are two cases to be 
considered. 

1. p E V . Then by Lemma [6]p )/- p' . 

2. p eV \ v. By Lemma El if p >~ p' then A1A2 < B1B2 in contradiction 
to the assumption of the theorem. Therefore p p' . 

□ 
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Theorem 5. If B y A, then T -plays with noncoinciding action moments of 
the players are not Pareto- optimal. 

Proof. Since according to Lemma [3] all the T-plays with noncoinciding 
action moments of the players are equivalent, we will give the proof for one 
T-play only, namely for the play p^ = (r, rj) in which 

Ti = til for z = 1, . . . ,m; 

r]j = tmj for j = 2, . . . , n; r]i = 1. 

Set 

^2 = ^2^)^ where t]"^ = (tn, 772, ■■■,r]n)- 
The plays p^,p'^ satisfy the conditions of Lemma IH therefore 

B y A =^ p^ y p^. 

Thus the T-play p^ and all the other T-plays with noncoinciding action mo- 
ments of the players are not Pareto-optimal. □ 

Theorem 6. If the duel TmniP, A, B) is quasi- antagonistic, then one of the 
following two conditions holds: 

(1) (A - Bi){A2 - B2) < 0; (2) A, = Bf, j = 1,2. (52) 

Proof. Assume that the plays satisfy the conditions fl51|) . (^^. Sup- 

pose that no one of the conditions (l52l) holds. Then the following two cases 
are possible. 

1. A y B. Then according to the statement (1) of Lemma H] one has 
p^ y p"^. Therefore the play p^ is not Pareto-optimal, and so the game 
is not quasi-antagonistic. 

2. B y A. Then according to the statement (2) of Lemma H] one has 
p"^ y p^. Therefore the play p^ is not Pareto-optimal, and so the game 
is not quasi-antagonistic. 

□ 
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